Batch effect correction methods for NASA GeneLab transcriptomic datasets

نویسندگان

چکیده

Introduction: RNA sequencing (RNA-seq) data from space biology experiments promise to yield invaluable insights into the effects of spaceflight on terrestrial biology. However, sample numbers each study are low due limited crew availability, hardware, and space. To increase statistical power, RNA-seq datasets different missions often aggregated together. this can introduce technical variation or “batch effects”, differences in handling, processing, platforms. Several computational methods have been developed correct for batch effects, thereby reducing their impact true biological signals. Methods: In study, we combined 7 mouse liver NASA GeneLab (part Open Science Data Repository) evaluate several common effect correction (ComBat ComBat-seq sva R package, Median Polish, Empirical Bayes, ANOVA MBatch package). Principal component analysis (PCA) was used identify library preparation method mission as primary sources among variables dataset. We next quantitatively evaluated ability indicated identified variable using following criteria: BatchQC, PCA, dispersion separability criterion, log fold change correlation, differential gene expression analysis. Each variable/correction combination then assessed a custom scoring approach optimal dataset, by geometrically probing all allowable functions an aggregate volume-based measure. Results Discussion: Using described dataset variable/ComBat pair out ranked other candidate pairs, suggesting that should be corrected ComBat prior downstream describe multi-study visualization portal which will allow users access publicly available ‘omics data, select multiple studies combine analysis, examine presence absence metrics. If user chooses perform correction, here implemented use specific

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

islanding detection methods for microgrids

امروزه استفاده از منابع انرژی پراکنده کاربرد وسیعی یافته است . اگر چه این منابع بسیاری از مشکلات شبکه را حل می کنند اما زیاد شدن آنها مسائل فراوانی برای سیستم قدرت به همراه دارد . استفاده از میکروشبکه راه حلی است که علاوه بر استفاده از مزایای منابع انرژی پراکنده برخی از مشکلات ایجاد شده توسط آنها را نیز منتفی می کند . همچنین میکروشبکه ها کیفیت برق و قابلیت اطمینان تامین انرژی مشترکان را افزایش ...

15 صفحه اول

Error correction for massive datasets

The paper is concerned with the problem of automatic detection and correction of errors into massive data sets. As customary, erroneous data records are detected by formulating a set of rules. Such rules are here encoded into linear inequalities. This allows to check the set of rules for inconsistencies and redundancies by using a polyhedral mathematics approach. Moreover, it allows to correct ...

متن کامل

Two New Methods of Boundary Correction for Classifying Textural Images

With the growth of technology, supervising systems are increasingly replacing humans in military, transportation, medical, spatial, and other industries. Among these systems are machine vision systems which are based on image processing and analysis. One of the important tasks of image processing is classification of images into desirable categories for the identification of objects or their sp...

متن کامل

Selection Methods for Multidimensional Datasets

One of the most challenging problems encountered when analysing multidimensional datasets is overabundance of features. In order to remove irrelevant, redundant, and noisy information from the data, some feature selection algorithms are used. The two-level feature selection analysed method contains Wrapper and Filter based feature search and evaluation. The experiments were performed on a datab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Frontiers in Astronomy and Space Sciences

سال: 2023

ISSN: ['2296-987X']

DOI: https://doi.org/10.3389/fspas.2023.1200132